COVID-19 and its potential effect on the election

As I previous justified in my blog post about electoral shocks and COVID-19, my model relied on polling and economic data to serve as proxies for any impact of COVID-19 on the 2020 election. Given the importance of economic fundamentals in determining election winners, it is not unreasonable to think that the pandemic and its toll on the economy must have hurt his bid for re-election. Considering all that we have learned over the past few months, it seems that Trump’s strong (pre-pandemic) economic record and his incumbent advantage could have reasonably carried him to another 4 years in the White House. In an alternate universe without COVID-19, would Donald Trump have won the 2020 election? In other words, did Trump lose because of COVID-19, or would Biden still have defeated the incumbent president in a COVID-free world?

Given his fairly large loss in 2016, a Trump victory in the popular vote would have been unlikely even without the pandemic, but the Electoral College might have been a different story. Several paths existed for a Trump electoral victory, but none of them panned out for him on Election Night. Donald Trump failed to secure Arizona, Georgia, and Wisconsin in 2020, despite having won all three of them in 2016. These are just a few of several states that were hit relatively hard by the pandemic and flipped from red to blue in 2020.

Why does it matter and how will I test it?

This narrative absolutely warrants some explanation: the Trump and Biden administrations would likely take very different approaches to policy making in 2021-2024. The 1918 Spanish influenza pandemic provides the only comparable situation to that which we face in 2020. In order to determine a methodologically sound approach, I structured my approach similar to past research on the Spanish flu’s impact on 1918 gubernatorial races.

Previous research on the 1918 midterms and 1920 general election suggests that the pandemic had negligible in any electoral impact.1 However, the national dialogue around the pandemic looks quite different now than it did a century ago. Relative to the magnitude of the pandemic, the Spanish flu received little public attention, which contrasts greatly with how COVID-19 has dominated nearly every facet of life in 2020. So, while we cannot automatically extend the conclusions from the 1918 pandemic to COVID-19, we can use similar methodology to take a preliminary look at COVID’s electoral impact.

In Democracy For Realists, Achen and Bartels examined whether the states and cities hit hardest by the pandemic responded differently at the polls.2 While they focused on gubernatorial races during the 1918 midterms, I plan on applying the underlying structure of their analysis to the 2020 presidential race. Similar to Achen and Bartels, I plan on running a simple regression that maps Donald Trump’s 2020 vote share from his 2016 vote share and COVID cases and/or deaths.3

Regression Data

For this regression, I had to gather a mixture of COVID, population, and voting data from several sources:

Regression Results

Achen and Bartels centered their analysis around excess flu deaths and vote share in the previous election, so ny initial regression mapped Trump’s 2020 state-level vote share from his 2016 vote share and total COVID deaths up to Election Day as a percentage of that state’s population. The p-value for the COVID deaths coefficient was quite large, but the same regression with cases instead yielded a much smaller, but still insignificant p-value. Both of these coefficients were positive, but the large p-values make it difficult to draw any conclusions from the regression. Perhaps, COVID was more of a deciding factor in battleground states?

Sure enough, the state-level regressions yielded more significant coefficients for the COVID terms when focusing solely on battleground states. The most significant coefficient was when looking at COVID cases in battleground states, which had a p-value of \(0.086\). If we want to interpret this coefficient with a significance level of \(\alpha = 0.10\), we can say that a 1% increase in a battleground state’s case count as a percentage of the population is associated with an approximate increase of \(0.57\)% of Trump’s 2020 two-party vote share within that state.

Comparing State-Level Models
  All States and COVID Deaths All States and COVID Cases Battleground States and COVID Deaths Battleground States and COVID Cases
Predictors Estimates p Estimates p Estimates p Estimates p
(Intercept) 0.98 0.497 1.36 0.282 2.04 0.647 5.62 0.206
Trump’s 2016 Two-Party Vote Share 0.95 <0.001 0.93 <0.001 0.90 <0.001 0.83 <0.001
Deaths as Percent of Population 3.36 0.644 26.03 0.141
Cases as Percent of Population 0.28 0.276 0.57 0.086
Observations 50 50 16 16
R2 / R2 adjusted 0.972 / 0.971 0.973 / 0.972 0.905 / 0.890 0.911 / 0.897

The significance of that term depends on how you select your \(\alpha\) value. However, whether you conclude its significant or not, it does reveal that there is possibly a positive association between Trump’s 2020 vote share in battleground states and higher total cases as percent of the population when controlling for Trump’s lagged vote share in that same state.

Next, I wanted to extend the analysis one step further and observe the same regressions with county-level COVID metrics and vote shares. Again, all of the coefficients indicated a positive relationship between Donald Trump’s 2020 vote share and increase in COVID cases or deaths as a percentage of the county’s population. This time, all of the slope coefficients yielded significant p-values at an \(\alpha = 0.001\) significance level, confirming the positive association: ADD NOTE ABOUT INCREASING SAMPLE SIZE AND DECREASING P-VALUE

Comparing County-Level Models
  All Counties and COVID Deaths All Counties and COVID Cases Counties in Battleground States and COVID Deaths Counties in Battleground States and COVID Cases
Predictors Estimates p Estimates p Estimates p Estimates p
(Intercept) -1.40 <0.001 -1.19 <0.001 0.73 0.057 1.18 0.004
Trump’s 2016 Two-Party Vote Share 1.00 <0.001 1.00 <0.001 0.97 <0.001 0.97 <0.001
Deaths as Percent of Population 11.22 <0.001 14.94 <0.001
Cases as Percent of Population 0.30 <0.001 0.28 <0.001
Observations 2978 2978 1252 1252
R2 / R2 adjusted 0.975 / 0.974 0.974 / 0.974 0.964 / 0.964 0.961 / 0.961

The below plots provide a clearer visualization of this positive association between COVID-19 and Trump’s 2020 vote share:

State-Level

County-Level

## `geom_smooth()` using formula 'y ~ x'

New Model Results

Taking an interest in the results of the regression, I decided to take a more nuanced look at the implications of these findings. The regressions take very crude measures of COVID numbers and previous vote share, without considering possible confounding variables. My previous election model used a mixture of demographic variables, economic metrics, incumbency status, and polling numbers to produce a probabilistic forecast for the 2020 election. While the forecast did not match the election results exactly, it did match the outcomes fairly closely, so it would not hurt to examine what happens without any measured impact of COVID-19.

COVID-19 bled into the polling and economic data used for the predictions, so I took steps to try to erase or minimize any impact of COVID-19 on these metrics:

I used a very similar5 model equation to that from my final forecast. In this hypothetical, pandemic-free world, Trump lost both the Electoral College and the national two-party popular vote by an even larger margin than what panned out on the actual election day:

Candidate Electoral Votes Two-Party Popular Vote
Biden 349 53.265 %
Trump 186 46.735 %

Do these results support the narrative?

While these tests were imperfect measures, they indicate that coronavirus likely did not hurt, and may have even helped, Trump in his 2020 election bid. While I cannot conclude that COVID-19 caused Trump to perform better in the 2020 election, these preliminary measures do indicate that there is a positive association between COVID numbers and Donald Trump’s vote share.



  1. [Achen and Bartels, 2017] Achen, C. H. and Bartels, L. M. (2017). Democracy for realists: Why elections do not produce responsive government↩︎

  2. [Achen and Bartels, 2017] Achen, C. H. and Bartels, L. M. (2017). Democracy for realists: Why elections do not produce responsive government↩︎

  3. Achen and Bartels also included a dummy variable that indicated whether the specified gubernatorial candidate was a Democratic incumbent. Since Donald Trump ran in both 2016 and 2020, I did not include an indicator for incumbency and instead just focused on his vote share for both races.↩︎

  4. While this includes some data from after COVID-19 came to the United States, I had to expand the window of time in order to get a large enough sample size for the model to run for each state.↩︎

  5. I added an interaction term to the model from my original forecast for this iteration. In retrospect, it did not make sense to not include it in the first place since the state of the economy likely has opposite effects for incumbent and non-incumbent candidates.↩︎